Signi cance of Interspecies Matches when Evolutionary Rate Varies

نویسندگان

  • JIA LI
  • WEBB MILLER
چکیده

We develop techniques to estimate the statistical signii cance of gap-free alignments between two genomic DNA sequences, using human–mouse alignments as an example. The sequences are assumed to be suff ciently similar that some but not all of the neutrally evolving regions (i.e., those under no evolutionary constraint) can be reliably aligned. Our goal is to model the situation in which the neutral rate of evolution, and hence the extent of the aligning intervals, varies across the genome. In some cases, this permits the weaker of two matches to be judged as less likely to have arisen by chance, provided it lies in a genomic interval with a high level of background divergence. We employ a hidden Markov model to capture variations in divergence rates and assign probability values to gap-free alignments using techniques of Dembo and Karlin, which are related to those used for the same purpose by BLAST. Our methods are illustrated in detail using a 1.49 Mb genomic region. Results obtained from the analysis of human chromosome 22 using these techniques are also provided.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Signi cance of Locality and Selection Pressure in the Grand Deluge Evolutionary Algorithm

This paper presents the results of a parameter study of the Grand Deluge Evolutionary Algorithm, whose special features consist of local interactions between individuals within a spatially structured population and a self{adjusting control mechanism of the selection pressure. Since both ingrediences are parametrizable this study aims at the identi cation of the signi cance and sensitivity of th...

متن کامل

On the statistical signi cance of temporal ring patterns in multi-neuronal spike trains

Repeated occurrences of serial ring sequences of a group of neurons with xed time delays between neurons are observed in many experiments involving simultaneous recordings from multiple neurons. Such temporal patterns are potentially indicative of underlying microcircuits and it is important to know when a repeatedly occurring pattern is statistically signi cant. These sequences are typically i...

متن کامل

A simple method to estimate the signi®cance level of the catch probability in the catch removal method in river ®sh populations

This work presents a method for estimating the signi®cance level of the capture probability when the capture removal method is used in riverine ®sh populations. The method is based on adjustment of the linear relationship between capture probability and an index of capture ef®cacy. With this method the population size, the statistic 2 and the signi®cance level of the capture probability can be ...

متن کامل

The Future of Chess - Playing Technologies and the Signi cance ofKasparov Versus Deep

In this paper we argue that the recent Garry Kasparov vs. Deep Blue matches are signiicant for the eld of artiicial intelligence in several ways, including providing an example of valuable baseline benchmarks for more complex alternatives to contrast and justify themselves. We will also brieey summarize some of the latest developments on computer chess research and highlight how our own work on...

متن کامل

Compression , Signi cance and Accuracy

Stephen Muggleton The Turing Institute, 36 North Hanover Street, Glasgow G1 2AD, UK Ashwin Srinivasan The Turing Institute, 36 North Hanover Street, Glasgow G1 2AD, UK Michael Bain The Turing Institute, 36 North Hanover Street, Glasgow G1 2AD, UK Abstract Inductive Logic Programming (ILP) involves learning relational concepts from examples and background knowledge. To date all ILP learning syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003